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Abstract — This article proposes the first known algorithm 
that achieves a constant-factor approximation of the minimum 
length tour for a Dubins' vehicle through n points on the plane. 
By Dubins' vehicle, we mean a vehicle constrained to move at 
constant speed along paths with bounded curvature without 
reversing direction. For this version of the classic Traveling 
Salesperson Problem, our algorithm closes the gap between 
previously established lower and upper bounds; the achievable 
performance is of order n 2//3 . 

I. Introduction 

The Traveling Salesperson Problem (TSP) with its vari- 
ations is one of the most widely known combinatorial 
optimization problems. While extensively studied in the 
literature, these problems continue to attract great interest 
from a wide range of fields, including Operations Research, 
Mathematics and Computer Science. The Euclidean TSP 
(ETSP) [1], [2] is formulated as follows: given a finite point 
set P in K 2 , find the minimum-length closed path through 
all points in P. It is quite natural to formulate this problem 
in context of Dubins' vehicle, i.e., a non-holonomic vehicle 
that is constrained to move along paths of bounded curvature, 
without reversing direction. The focus of this article is the 
analysis of the TSP for Dubins' vehicle; we shall refer to it 
as DTSP 

Exact algorithms, heuristics as well as polynomial-time 
constant factor approximation algorithms are available for 
the Euclidean TSP, see [3], [4], [5]. It is known that non- 
metric versions of the TSP are, in general, not approximable 
in polynomial time [6]. Furthermore, unlike most other 
variations of the TSP, it is believed that the DTSP cannot be 
formulated as a problem on a finite-dimensional graph, thus 
preventing the use of well-established tools in combinatorial 
optimization. On the other hand, it is reasonable to expect 
that exploiting the geometric structure of Dubins' paths one 
can gain insight into the nature of the solution, and possibly 
provide polynomial-time approximation algorithms. 

A fairly complete picture is available for the minimum- 
time point-to-point path planning problem for Dubins' ve- 
hicle, see [7] and [8]. However, the DTSP seems not to 
have been studied as extensively. In [9], some results for the 
worst case tours of DTSP were provided. A lower bound 
on the expected cost of a stochastic DTSP visiting ran- 
domly generated points was provided in [10]. Here, we shall 
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specifically concentrate on the case when the target points 
in the environment are generated stochastically according 
to a uniform distribution. We shall refer to such a problem 
as stochastic DTSP. In this context the first algorithm with 
asymptotic sub-linear cost was proposed in [1 1]; an improved 
algorithm was proposed in [12]. 

The motivation to study the DTSP arises in robotics 
and uninhabited aerial vehicles (UAVs) applications, e.g., 
see [13], [14], [15]. In particular, we envision applying our 
algorithm to the setting of an UAV monitoring a collection of 
spatially distributed points of interest. Additionally, from a 
purely scientific viewpoint, it appears to be of general interest 
to bring together the work on Dubins' vehicle and that on 
TSP. Some concrete results along these lines have been 
obtained in [9] and in [11] where an algorithm to guarantee 
sub-linear cost for the stochastic DTSP was proposed. UAV 
applications also motivate us to study the Dynamic Traveling 
Repairperson Problem (DTRP), in which the aerial vehicle 
is required to visit a dynamically changing set of targets. 
This problem was introduced by Bertsimas and van Ryzin 
in [16] and then decentralized policies achieving the same 
performances were proposed in [13]. However, as with the 
TSP, the study of DTRP in context of Dubins' vehicle has 
eluded attention from the research community. 

The contributions of this article are twofold. First, we 
propose an algorithm for the stochastic DTSP through a point 
set P, called the Recursive Bead-Tiling Algorithm, 
based on a geometric tiling of the plane, tuned to the 
Dubins' vehicle dynamics, and a strategy for the vehicle 
to service targets from each tile. Second, we obtain an 
upper bound on the stochastic performance of the proposed 
algorithm and thus also establish a similar bound on the 
stochastic DTSP. The upper bound on the performance 
of the Recursive Bead-Tiling Algorithm belongs to 
0(n 2 / 3 ) with high probability, and it is known that the 
lower bound on the achievable performance belongs to 
f2(n 2 / 3 ). The algorithm we introduce in this article is the 
first known algorithm providing a provable constant-factor 
approximation to the DTSP optimal solution. 

Notation 

Here we collect some concepts that will be required in 
the later sections. A Dubins' vehicle is a planar vehicle that 
is constrained to move along paths of bounded curvature, 
without reversing direction and maintaining a constant speed. 
Accordingly, we define a feasible curve for Dubins ' vehicle 
or a Dubins' path, as a curve that is twice differentiable 
almost everywhere, and such that the magnitude of its 



curvature is bounded above by 1/p, where p > is the 
minimum turn radius. 

Let P = {pi, ... ,p n } be a set of 7i points in a compact 
region Q C I 2 and V n be the collection of all point sets 
P d Q with cardinality n. Let ETSP(P) denote the cost of 
the Euclidean TSP over P, i.e., the length of the shortest 
closed path through all points in P. Correspondingly, let 
DTSP P (P) denote the cost of the Dubins' TSP over P, i.e., 
the length of the shortest closed Dubins' path through all 
points in P. In what follows, p £ R + is take constant, and 
we study the dependence of DTSP p : V n — ► M+ on n. 

For /, g : N — > E, we say that / £ 0(g) (respectively, 
/ £ fi(g)) if there exist N £ N and k £ M + such that 
|/(AOI < for a11 N > N (respectively, |/(JV)| > 

k\g(N)\ for all N > N ). If / £ 0(g) and / £ n(g), then 
we use the notation / £ 0(g). 

II. The stochastic DTSP 

In [9], a simple heuristics, the ALTERNATING ALGO- 
RITHM for the Dubins' TSP for a given point set was 
proposed. The length of tour generated by this algorithm 
was also characterized and it was shown that it belongs to 
£l(y/n) and 0(n). It was also shown that this simple policy 
performs well when the points to be visited by the tour are 
chosen in an adversarial manner. However, it is reasonable to 
argue that this algorithm might not perform very well when 
dealing with a random distribution of the target points. In 
particular, one can expect that when n points are chosen 
randomly, the cost of the DTSP increases sub-linearly with 
n, i.e., that the average length of the path between two points 
decreases as n increases. In this section, we consider the 
scenario when n target points are stochastically generated in 
Q according to a uniform distribution. A novel algorithm, 
the Bead-Tiling Algorithm was proposed in [11] to 
service these points in such a way that its tour length grew 
sub-linearly with the number of points asymptotically with 
high probability, where an event is said to occur with high 
probability if the probability of its occurence approaches 1 as 
n — ► +oo. Here, we present a novel version of this strategy 
in the form of the Recursive Bead-Tiling Algorithm 
and characterize its performance. 

We assume that the environment Q is a rectangle of width 
W and height H; different choices for the shape of Q affect 
our conclusions only by a constant. In what follows we select 
a reference frame whose two axes are parallel to the sides of 
Q. Let n target points be generated stochastically according 
to uniform distribution in the region Q. Let V = (p\ , . . . , p n ) 
be the locations of these target points. 

A. A lower bound 

First, we summarize a result from [10], that provides a 
lower bound on the expected length of the stochastic DTSP 

Theorem 2.1: (Lower bound on stochastic DTSP) For all 
p > 0, the expected cost of a stochastic DTSP visiting 
a set P of n uniformly-randomly-generated points in Q, 
E[DTSP P (P)] belongs to n(n 2 / 3 ). 



B. A constructive upper bound 

In [11], a novel algorithm, the Bead-Tiling ALGO- 
RITHM, to compute Dubins' path through a point set in 
the region Q was proposed. In this section, we design the 
Recursive Bead-Tiling Algorithm based on the ideas 
of the Bead-Tiling Algorithm. We will show that the 
proposed algorithm provides a constant factor approximation 
to the optimal DTSP with high probability. We start by 
describing some useful geometric objects. 

1 ) The basic geometric construction: Consider two points 
p_ = (— 1, 0) and p + = (1, 0) on the plane, with I < p, 
and construct the region B p (l) as detailed in Figure Q in 




Fig. 1. Construction of the "bead" B p (l). The figure shows how the upper 
half of the boundary is constructed, the bottom half is symmetric. 

the following, we will refer to such regions as beads. The 
region B p (l) enjoys the following asymptotic properties as 

(l/p) - 0+: 

(PI) The maximum "thickness" of the region is equal to 



w(l) = Ap 1 



1 - 



2p 



(P2) The area of B p (l) is equal to 



Ave a [B p (l)] = lHl) = Y p +0 (ji 

(P3) For any p £ B p , there is at least one Dubins' path 
7 P through the points {p-,p,p+}, entirely contained 
within B p , and such that its length is at most 

/ I \ ( I 2 

Length(7„) < 4parcsin — ) = 21 + o I 

\2pJ VP 

These facts are verified using elementary planar geometry. 

2) Periodic tiling of the plane: An additional property of 
the geometric shape introduced above is that the plane can 
be periodically tiled by identical copies of B p (l), for any 
I £ (0,2p). (Recall that a tiling of the plane is a collection 



of sets whose intersection has measure zero and whose union 
covers the plane.) Let 

_ Arca[ff p (Q] 
Area[Q] ' 

Consider a bead B entirely contained in Q; the probability 
that the i-th point is sampled in B is equal to /i. Furthermore, 
the probability that exactly k out of the n points are sampled 
in B has a binomial distribution, i.e., indicating with ub the 
total number of points sampled in B, 

Prob[n B = k\n samples] = f™J //(l - n) n ~ k ■ (1) 

Choose [i as a function of n, in such a way that v = n/i(n) 
is a constant. In such a case, the limit for large n of the 
binomial distribution Q is the Poisson distribution of mean 
v, that is, 

v k _ 

lim Prob[riB = k\n samples] = — e " (2) 

C. 77ze algorithm 

Consider a tiling of the plane such that Area[S j0 (i)] = 
WH/(2n); in such a case, /i = l/(2n), and v = 1/2. (Note 
that this implies that n must be large enough that £ < 2p.) 
Furthermore, the tiling is chosen is such a way that it is 
aligned with the sides of Q, see Figure [2] 




Fig. 2. Sketch of tiling of the region before the first phase of the 
Recursive Bead-Tiling Algorithm. 

The proposed algorithm will consist of a sequence of 
phases; during each of these phases, a Dubins tour (i.e., a 
closed path with bounded curvature) will be constructed that 
"sweeps" the set Q. 

In the first phase, a Dubins tour is constructed with the 
following properties: 

(i) it visits all non-empty beads once, 

(ii) it visits all rows 1 in sequence top-to-down, alternat- 
ing between left-to-right and right-to-left passes, and 
visiting all non-empty beads in a row, 

(iii) when visiting a non-empty bead, it services at least 
one target in it. 

In order to visit the outstanding targets, a new phase is 
initiated. In this phase, instead of considering single beads, 

'A row is a maximal string of beads with non-empty intersection with 

Q. 



we will consider "meta-beads" composed of two beads each, 
as shown in Figure 0, and proceed in a similar way as 
the first phase, i.e., a Dubins tour is constructed with the 
following properties: 

(i) the tour visits all non-empty meta-beads once, 

(ii) it visits all (meta-bead) rows in sequence top-to- 
down, alternating between left-to-right and right-to-left 
passes, and visiting all non-empty meta-beads in a row, 

(iii) when visiting a non-empty meta-bead, it services at 
least one target in it. 

This process is iterated at most log 2 n + 1 times, and 
at each phase meta-beads composed of two neighboring 
meta-beads from the previous phase are considered; in other 
words, the meta-beads at the i-th phase are composed of 2 i ~ 1 
neighboring beads. After the last phase, the leftover targets 
will be visited using, for example, a greedy strategy, or the 
Alternating Algorithm. 

We have the following result, which we prove using a 
technique similar to that developed in [17]. 

Lemma 2.2: Let P G V n be uniformly randomly gener- 
ated in Q. Then, the number of unvisited targets after the 
last phase of the Recursive Bead-Tiling Algorithm 
over P belongs to O(logn) with high probability. 

Proof: Associate a unique identifier to each bead, e.g., 
integers between 1 and 2n; call such a set of identifiers T. Let 
b(t) G T be the identifier of the bead in which the t-th target 
is sampled, and let h(t) G N be the phase at which the i-th 
target is visited. Without loss of generality, we will assume 
that if b(ti) = b{t 2 ), and t\ < t 2 , then h(ti) < h(t 2 ). 

Indicate with vi (t) the number of beads that contain unvis- 
ited targets at the inception of the i-th phase, computed after 
the insertion of the i-th target. Furthermore, let rrii be the 
number of i-th phase meta-beads (i.e., meta-beads containing 
2 l_1 neighboring beads) with a non-empty intersection with 
Q. Clearly, Vi(t) < Vi(n), rrii < 2m I+ i, and v\(n) < n < 
mi/ 2 with certainty. 

The t-th target will not be visited during the first phase if 
it is sampled in a bead that already contains other targets. In 
other words, 

^ r, / % / m vi(t) vi(n) 1 

Pr[h(t) > 2\ Vl (t)} = ^-l < ->±-L < -. 

Similarly, the t-th target will not be visited during the i-th 
phase if (i) it has not been visited before the i-th pass, and (ii) 
it belongs to a meta-bead that already contains other targets 
not visited before the i-th phase: 

Pr [h(t) > i + l\(vi(t - 1), - l),vx(t - 1))] 
= Pr[h(t) > i + l\h(t) > i,Vi(t- 1)] • 
Pr[fc(*) > i\{vi-i(t-l),..., Vl (t-l))] 

< V ^ - l) Pr[h(t) > t|(tii-i(< - 1), ■ . . ,«i(t - 1))] 
rrii 

-n^n^-(£)V>- 

j=i 1 j=i \ j j=i 

(3) 



Phase 1 



Phase 2 



Phase 3 



Fig. 3. Sketch of "meta-beads" at successive phases in the recursive bead tiling algorithm. 



For a fixed i > 1, define a sequence of binary random 
variables 

f 1, if h(t) > i + 1 and Vi(t - 1) < fan; 
* \ 0, otherwise. ( ' 

In other words, Y t is equal to 1 if the t-th target is not visited 
within the first i phases despite the fact that the number of 
beads still containing unvisited target at the inception of the 
i-th phase is less than fan; the values {fa} will be defined 
shortly. 

Even though the random variable Y t depends on the targets 
generated before the t-th target, the probability that it takes 
the value 1 is bounded by 

i 

Pr[Y t = l\b(l), 6(2), . . . , 6(t - 1)] < 2^ J[ fa =: Pi , 

3 = 1 

regardless of the actual values of 6(1), . . . , b(t — 1). 

It is known (e.g., see [17]) that if the random variables 
Yt satisfy such a condition, the sum Y t is stochastically 
dominated by a binomially distributed random variable, 
namely, 



Pr 

In particular, 



< Pr[B(n,pi) > k] 



Pr 



Y J Y t > 2n Pl 



,t=i 



<Pr[B(n, Pl )>2n Pl ]<2- n P>/ 3 , 



(5) 

where the last inequality is obtained using Chernoff's bound. 
Let us define the sequence {fa} through the recursion 



fa = 
fa+i = 

which leads to 



1. 

2ft 



_ »(»-3) 

2~ — 



■ 1 IIA = 



i-2 Q 1 



fa, 



fa = 2 



l-i 



(6) 



With the above definition in mind, © can be rewritten as 



Pr 



t=i 



<Pr[B(n, Pi ) > fa +1 n] 

< 2-^+ in / 6 = 2"i^ 



which is less than 1/n 2 for i < i*(n) 
log 2 log 2 n — log 2 6J < log 2 n. Note that 

log 2 n 



[log 2 n 



fa < 12 



Vi > i*(n). 



(7) 



Let Si be defined as the event that Vi(n) < fan. Note that 
if £j is true, then Vi+\{n) < 5Z™=i tne right hand side 
represents the number of targets that will be visited after the 
i-th phase, whereas the left hand side counts the number of 
beads containing such targets. We have, for all i < i*(n): 



Px[v i+1 > fa +1 n\£ t ]-PY%] < Pr 
that is , 

Pr[-.£ + i|£] < - 



^2Y t >fa 



+ 171 



< 



1 



Pr[^]' 

and thus (recall that E\ is true with certainty): 

PrhA+i] < i+Pr[^i] < -\. 

In other words, for all i < i*(ri), Vi(n) < fan with high 
probability. 

Let us turn our attention to the phases such that i > i*(n). 
The total number of targets visited after the i*-th phase 
is dominated by a binomial variable B(n, 12 log 2 n/n); in 
particular, 



Prkv +1 > 241og 2 n|£' i .]-Pr[£ i .] < Pr 



< Pr[B(n, 12 log 2 n/n) > 241og 2 n] < 2~ 12los ? n ; 
dealing with conditioning as before, we get 

Pr [ Vi . +1 > 241og 2 n) < ± + Pr^,] < ± + 

In other words, the number of targets that will be left after 
the i*-th phase will be bounded by a logarithmic function of 
n with high probability. ■ 

D. A bound on the length of the solution 

What we know at this point is that after a sufficiently 
large number of phases, almost all targets will be visited, 
with high probability. The key point is to recognize that the 
length of each phase is decreasing at such a rate that the sum 
of the lengths of all the phases remains bounded. We first 
state and prove the following result which characterizes the 



length of Dubins' path required to execute the RECURSIVE 
Bead-Tiling Algorithm. 

Lemma 2.3: (Length of path for the RECURSIVE BEAD- 
Tiling Algorithm) Let P e V n be uniformly randomly 
generated in Q. Then the length of Dubins' path required 
to execute logn phases of the Recursive Bead-Tiling 
Algorithm over P belongs to 0(n 2 / 3 ). 

Proof: Let Li denote the upper bound on the length of 
the path for the i th phase. Then one can see that 



r io S 2(n) 1 

( Li < 3 L2j-\. 
i=l j=i 



log 2 (ri) 



Let us first compute the length of a pass, in either 
direction. The number of beads traversed will be no more 
than 

(8) 



20 L 



Cl i 
— 713 

2i 



where c\ 



w 



is a constant. The length of Dubins' 



y P WH 

path contained entirely within a meta-bead at the (2j — l)-th 
phase is less than 2 J ~ 1 (2l n + o(lfj). Hence, the total path 
length per pass will be bounded by: 



^pass,2j — 1 



< 2 1 ~ 1 (2L 



—n 3 + 1 

,23 

= Cl l n ni + 2? In + jnh(ll) + 2^ 1 o(? I 2 i ) 

(9) 

as l n — > + . 

The cost of a U-turn, i.e., the length of the path needed 
to reverse direction and move to the next row of beads, is 
bounded by 

7 7 I 2 

iu-turn,2j-l < -IT p+2^- 2 w(l n ) = ^P+^ {f- p 



The total number of passes will be at most 



iV, 



pass,2j — 1 



H 



V- 2 w{l n ) 



< 



23-H 2 



1. 



(10) 



(ID 



The cost of closing the tour is bounded by a constant, say 

-^closure, 2j — l <4(W + Http). (12) 
Concluding, the total path length will be bounded by 

L2j-1 = ^Vpass,2j-l(-^pass,2j-l+iu-turn,2j-l)+-C'closure,2j-l- 

(13) 

Substituting eqns. (|9), JTOi . (TITl and ( fT2t in eqn. fl3t . one 
can find constants k\ , k-2 and such that 



L 



2j-i < ki(l+n^o(n 3)+nz)+k 2 2 : > (n 3 + o(n 

no(n 3 ) + n 3 ) . 



k 3 2-i(l 



From this expression for the length of path during odd 
phases, one can conclude that 



E 

!=1 



L, < 3 £ L 2 i-i G 0(n 2/3 ). 

3=1 



Based on the results obtained so far, we are now ready to 
state an upper bound on the length of the path traveled by 
Dubins' vehicle to service all the n targets while executing 
the Recursive Bead-Tiling Algorithm followed by 
the Alternating Algorithm; let L RccB ta,p(-P) repre- 
sent the corresponding quantity. 

Theorem 2.4: (Upper bound on the length of the total 
path) Let P £ V n be uniformly randomly generated in 
Q. For all p > 0, L RccB ta, p (P) G 0(n 2 / 3 ) with high 
probability. 



Proof: By Lemma 12.31 the length of path to exe- 
cute the Recursive Bead-Tiling Algorithm belongs 
to 0(n 2 / 3 ). From Theorem 12.21 the number of targets 
remaining at the end of the RECURSIVE Bead-Tiling 
Algorithm belongs to O(logn) with high probability. 
These remaining points can be serviced by any greedy policy 
or some heuristics (e.g., ALTERNATING ALGORITHM [9]) 
in O(logn) time. The statement of the theorem follows 
immediately. ■ 
Combining results from Theorem 12.11 and Theorem 12.41 
one can conclude that the Recursive Bead-Tiling Al- 
gorithm is a constant factor approximation to the optimal 
DTSP with high probability. 

III. Conclusions 

In this article, we have studied the TSP problem for vehi- 
cles that follow paths of bounded curvature in the plane. For 
the stochastic setting, we have obtained upper bounds that 
are within a constant factor of the lower bound established 
in literature [10]; the upper bounds are constructive in the 
sense that they are achieved by two novel algorithms. It 
is interesting to compare our results with the Euclidean 
setting (i.e., the setting in which curves do not have curvature 
constraints). For a given compact set and a point set P of 
n points, it is known [1], [2] that the ETSP(P) belongs 
to Q(y/n). This is true for both stochastic and worst-case 
settings. In this article, we showed that, given a fixed p > 0, 
the stochastic DTSP p (P) belongs to 6(n 2/3 ) with high 
probability. It is known [9] that the worst-case DTSP p (P) 
belongs to 9(n). 

In the future, we plan to perform extensive simulations to 
support the results obtained in this article. Future directions 
of research include study of centralized and decentralized 
versions of the DTRP and general task assignment and 
surveillance problems for various non-holonomic vehicles. 
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